L . Moench , O . Rose , eds . ON STEP SIZES , STOCHASTIC SHORTEST PATHS , AND SURVIVAL PROBABILITIES IN REINFORCEMENT LEARNING
نویسندگان
چکیده
Reinforcement Learning (RL) is a simulation-based technique useful in solving Markov decision processes if their transition probabilities are not easily obtainable or if the problems have a very large number of states. We present an empirical study of (i) the effect of step-sizes (learning rules) in the convergence of RL algorithms, (ii) stochastic shortest paths in solving average reward problems via RL, and (iii) the notion of survival probabilities (downside risk) in RL. We also study the impact of step sizes when function approximation is combined with RL. Our experiments yield some interesting insights that will be useful in practice when RL algorithms are implemented within simulators.
منابع مشابه
An Efficient Method for Selecting a Reliable Path under Uncertainty Conditions
In a network that has the potential to block some paths, choosing a reliable path, so that its survival probability is high, is an important and practical issue. The importance of this issue is very considerable in critical situations such as natural disasters, floods and earthquakes. In the case of the reliable path, survival or blocking of each arc on a network in critical situations is an un...
متن کاملPredisaster Preparation of Transportation Networks
We develop a new approach for a pre-disaster planning problem which consists in computing an optimal investment plan to strengthen a transportation network, given that a future disaster probabilistically destroys links in the network. We show how the problem can be formulated as a non-linear integer program and devise an AI algorithm to solve it. In particular, we introduce a new type of extrem...
متن کاملIntersection properties of Brownian paths
This review presents a modern approach to intersections of Brownian paths. It exploits the fundamental link between intersection properties and percolation processes on trees. More precisely, a Brownians path is intersect-equivalent to certain fractal percolation. It means that the intersection probabilities of Brownian paths can be estimated up to constant factors by survival probabilities of ...
متن کاملEvaluation of Multiagent Search Performance Revised Proposal & Midterm Review
Pathfinding the simple process of finding a route from one point to another comes with ease to humans as well as animals and is as essential to survival as is to convenience. On the other hand, it is a remarkably difficult task to replicate in the artificial world. Because it is essential to numerous technological applications, notably autonomous locomotion of mobile robots, movement of agents ...
متن کاملk-Survivability: Diversity and Survival of Expendable Robots
We define the k-survivability of a set of n paths as the probability that at least k out of n robots following those paths through a stochastic threat environment reach goals. High k-survivability sets tend to contain short and diverse paths. Finding sets of paths with maximum k-survivability is NPhard. We design two algorithms: a complete algorithm that finds an optimal list of paths, and a he...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009